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I. 



INTRODUCTION 



A. OVERVIEW 

A common problem in various areas of operations research 
and applied statistics, e.g. in reliability and maintaina- 
bility studies, is that of predicting from available data 
the probability that a future observation exceeds a given 
value. An example arising in nuclear plant reliability is 
that a crucial repair or down time exceeds h (=4.) hours. 
Another problem is to predict the "100-year flood", or earth- 
quake, etc. The latter problem is difficult because there 
will usually be far less than 100-years worth of data to work 
with. Still another problem is that of predicting the pro- 
bability of survival for h = 5 years for a cancer victim re- 
ceiving a particular treatment. 

The simplest formulation is to assume that the data is a 
random sample from a probability distribution F (x) (con- 
tinuous, i.e. having a density) , with density f (x) . That 
is, observed values are x^, x^, . . • , x n , being independent 

realizations of independent identically distributed random 
variables genericallv denoted by X. If one is willing to 
assume also that the mathematical form of F ( x ) = F (x;0) is 
known (e.g. is Log-normal, or Gamma, or another candidate) 
then what one can do is: 



6 



(a) Estimate the possibly multidimensional parameter 0 

2 

(e.g. 9 could be u, a , the population mean and variance for 
a log-normal model, estimated by lnx = y and 

2 i 5 2 

S lnx = n^T .4- ^ lnx i “ lnx ) classically). 

(b) Quote the point estimate F (x;0) , or, in the present 

X 

case 1 - F (h;§) for probability of survival beyond h. 

(c) Utilize facts about the sampling distribution of § 
to find a standard error or confidence limits on F (h;9) , the 

X 

survival probability. 

The basic assumption, then, is that data can reasonably 
be assumed to be a random sample from a fixed distribution, 
the form of which is known. There are various ways in which 
such convenient assumptions can be violated, one obvious one 
being that the fixed distribution idea is not justifiable 
(perhaps because of important detectable variation in the 
distribution from location to location, or plant to plant, 
from repair crew to repair crew, etc.). Another might be 
that some data points are missing: too-short ones (down 

times) are not written down or else are recorded incorrectly, 
and too-long ones are regarded as being so exceptional as 
never to recur, and hence are removed. Possible or likely 
departures from the basic assumption should be investigated. 
The raw data should be carefully examined in an exploratory 
spirit (see J. W. Tukey [Ref. 1]), e.g. by graphics to check 
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for departures from the basic "stationary" assumption. In 
this discussion we rule out such variations. 

This paper gives an account and some evaluation of 
several different ways of accomplishing step (c) above (con- 
fidence limits for the probability of survival or excedance 
of time h and related topics) . It will discuss four differ- 
ent methods for attacking the estimation and confidence 
limits problem. 

1 . Mathematical Formulation 

We shall assume that (x^ , . . . ,x n ) are the complete 

times of repair (or down times) , and that they are indepen- 
dent realizations of the generic random variable X, where 

2 

Y = lnX is normally distributed with mean y and variance a , 
both unknown. This kind of assumption is often made in prac- 
tice. This implies that the probability that a randomly se- 
lected, future, down or repair time exceeds h is given by the 
formula 



F x (x; y, a ) 




( 1 . 1 ) 



ble when 



In practice, this formula is not immediately applica- 
2 

y and a are unknown but if we estimate 



1 n 

u = — 7 lnx . = lnx 

n . L , l 
1=1 



( 1 . 2 ) 
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n 

o 2 = ^ dnx. - lnx) 2 , 

i=l 



(1.3) 



we can go ahead and quote a point estimate; the latter de- 
pends on 



lnh - y 

V?" 



(1.4) 



If we examine this quantity (integration limit) it 
will be seen to be a single realization of a random variable 
written as 



0 




(1.5) 



— 2 2 2 2 

where Y is N(y,c ) and S is o X( n _u proportional to a 
Chi-squared r.v., the latter being independent of Y by the 
convenient (log) normal assumption. Now re-write 9 as 



9 = (H-u) - (Y-y) (1 

/h -ji 

If (H-y) = 0 then (-0v / n) would be precisely distri- 
buted as a Student's t. On the other hand 

-e/5 = ,1 

y 

If we write 5 = y-H then 

-8/n = ( *-| (1 

y 
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has a known density function, that of the Non-central t which 



is conveniently expressed in terms of the non-centrality 
parameter 

/n <$ 

Y = — 

Classical methods exist for utilizing this to estab- 
lish tolerance limits . In this paper a different approach 
is followed. We examine the performance of several conve- 
nient approximate methods for assessing the uncertainty in 
the simple point estimate (1.1), where estimate (1.2) and 
(1.3) are used for the parameter values. These methods are 
the Delta method (linearization), the Jackknife, and the 
Bootstrap, as well as a completely distribution-free (Ber- 
noulli trials) method. Details now follow. 



3. PURPOSE AND APPROACH 

1 . Distribution-Free Approach 

In general, suppose we want to solve the problem of 
estimating the survival probability without any distribu- 
tional assumption, other than that observations are iid. The 
simplest way is to use the binomial approach. If (x^,X 2 ,..., 
x n ) indicates the iid. sample of down or repair times, we can 
estimate P (x>h) , survival probability, by means of 



P [X>h] = # (X - ' . S _ ^ h) = p 



n 



(1.9) 



Then we can set up a confidence interval for the sur- 
vival probability (1.9) by making use of the fact that for 
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large n the binomial distribution can be approximated by a 
normal distribution. An approximate (1-a) *100% confidence 
interval for the binomial parameter p is given by 

P - z a/iJ Pn n P) < P < ^ + \/2' l PU n (1 ' 10 » 

where z ,_ ( 1 - a/2) • 100% point of the tabled unit normal, 
a/ 2 

2 . Maximum Likelihood Approach 

We can assume along with others, that repair time 

data comes as a random sample from a log-normal population: 

2 

Y = lnX where X is Normally, a ) . This assumption will be 
crucial in all three methods. Then the maximum likelihood 
estimates (M.L.E.) of the parameter are as stated before: 

n 



» = * = H 

i=l 



( 1 . 11 ) 



n 



d = 



= E (y i ' ?)2 * 



= s 



( 1 . 12 ) 



i=l 



.2 



Strictly speaking, d (1.12) is the M.L.E. multiplied 

2 

by (n/n-1) and is unbiased for a . Furthermore, in repeated 
samples of size n we have that "exactly" (assuming the model 
is correct) : 

*2 

(i) j) = Y is Normally, — ) (1.13) 

n 



(ii) 



e 2 = s 2 is - 

y n-1 

where E[d^] = a^, and Varfd^] = 2a^/ (n-1) 



(1.14) 



(iii) y and are statistically independent. 
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2 

Thus for large n both y and d tend to be close to 
their respective population values, guaranteeing a good ap- 
proximation to the survival probability if model (1) is cor- 
rect. Now according to the assumed model, the probability 
of exceeding h hours is 



P (x>h) 



co 




lnh-y 



2 



dz 

/2t 



(1.15) 



The maximum likelihood estimate of this probability 

2 2 

is obtained by replacing y by y, a by d . 



P (x>h) 




lnh-u 



2 

dz 

/2tt 




(1.16) 



Now find upper and lower limits for the parameter: 



q 



lnh - y 
a 



(1.17) 



i.e. q and q are functions of the observations such that 
c - q - q with prescribed probability (1-a) *100%, say 95%. 
These then translate into upper and lower limits on the 
probability of exceeding h 



on 



/ 



e 



-^z 



2 

— - < P (x>h) 

/nr 




(1.18) 
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If we compute a and q from a sample, then, under the 
initial assumptions, we have the desired confidence limits 
for the probability of exceeding h. 

3. Delta Method (DL) 

The delta method is an approximate way of finding 
the distribution of q. It is known that functions such as q 
are approximately normally distributed for "sufficiently 
large" n (see Cramer [Ref. 2]) . We estimate q by 



q 



lnh - 0 

' A 

a 



(1.19) 



and use the "delta method", or method of linearization, or 
small errors, to estimate the variance: 



var[q] = (fif Var[0] ♦ Var[a 2 ] 



( 1 . 20 ) 



There is no covariance term because of the (theoretical) in- 

2 

dependence of 0 and 6 , see (iii) above in section 2. This 
formula yields 



2 4 

3a 1 Tr a 2 a 3q _ - (lnh - u) 

= --r, Var v = — , Var [a ] = — y, -Hs- = T~TT) > 

au a n n-1 gd 2 2 .(<? 2 ) 3/2 



( 1 . 21 ) 



so 



Tr . 1 c 2 . 1 lnh - 0) 2 

Var [q] = -y — + j T~3 — 

d z (a )-* 



2a 



Var [q ] , | + f JMll 

a • (n-1) 



1 

— < 
n 



(n-1) 
2 



, 1 (lnh-u) 

1 2 * 2 

Z d 



( 1 . 22 ) 



(1.23) 
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Assume q can be taken to be normal with mean q and 
. 2 

variance and quote these approximate confidence limits: 



^DL = * + Z 1 -a/2‘yj d q 




(1.24) 



— DL ^ 2 1 - a/ 2 



(1.25) 



This translates into the desired (but approximate) 
confidence limits for the probability of exceeding h: 



CO 



‘DL 



-Hz' 



dz 

v’2rr 



5 P (x>h) 



a 

«/ 

2 dl 



-hz‘ 



dz 

/2tt 



(1.26) 



Several approximations have been made in the process 
described and the validity, for moderate n, of such a rela- 
tively simple process, must be checked. Notice that the 
exact distribution of q is non-central t under the basic 
model assumption. This approach replaces the n-c.t by a 
convenient normal approximation. 

4 . Jackknife Method (JK) 

The jackknife is an alternative way of putting con- 
fidence limits on the parameter 

„ _ lnh - u 
^ a 

For further discussion see Mosteller and Tukey [Ref. 

3] and Efron [Ref. 4]. In brief, the jackknife method has 
the capacity to reduce the bias of estimates of such quantities 
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and, more importantly, to furnish confidence limits that be- 
have in a satisfactory manner. 

Jackknife estimates and confidence limits are con- 
structed by successively leaving out parts of the available 
data to construct pseudovalues . These are then averaged, and 
the stability of the average assessed by use of Student's t 
or the Normal in order to obtain confidence limits. The pro- 
cedure is given below for our case: 

(1) Form the estimate 



Y n > 



= lnh - y 



(1.27) 



This is the m.l.e. using all the data, just as before. 

(2) Form the estimates i * y i ,y 2 ' * ‘ ‘ ,y i-l 



yi,...,y n ) i = 1, 2, . . . , n; these are similar to q n , 

but omit successively each single observation y^ ,y 2 , . . . ,y n ? 
at the next stage each observation is then restored and the 
following taken out, as i runs from 1 to n and thus there 
are n values q. .. • . 

(3) Compute the pseudovalues as follows: 

u^ — nq^ “ (n — l)q^^_i^ ^ i 1, 2, . . • ,n (1.28) 

(4) Compute the mean and variance of the pseudovalues 

n 



u = — j u. 
n i=l 1 



(1.29) 



n 



-v 2 



u 



n^T l ,« u i ' u) 
1=1 



(1.30) 
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(5) Approximate (accuracy increasing with n in- 
creasing) (l-a)*100% confidence limits for q are given by: 

S S 

2 jk = u ~ t a /2 (n_1) - q ^ u + — t a/2 (n-1) = q JK (1.31) 

where is the (1 -a/2) *100 percent point of Student's 

t (the standard, central, distribution). Also we can use z 

a/ 2 

as before as an option. 

(6) This means that, with approximate (l-a)*100% 
confidence, the probability of survival is between the two 
confidence limits that follow: 



oo 

J 



-hz‘ 



dz 

v'2tt 



£ P (x>h) £ 



OO 

J 



-hz‘ 



dz 

/T if 



(1.32) 



This procedure, based on the m.l.e., has been theo- 
retically validated for large n. It competes with the delta 
method, but is somewhat more difficult to carry out. 

5 . Bootstrap Method (BT) 

The bootstrap method (see Efron (1979) [Ref. 4]) is 
similar to the jackknife method, but differs in being a re- 
sampling procedure. The procedure is given as below for our 
case : 

(1) Calculate 



q = <3 n (y l' y 2'* ' ' 'Yj 



= lnh Y 



n 



This is the m.l.e. using original data, same as before. 
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(2) Draw a "Bootstrap sample", using Y -^>Y 2 > • • % >Y n as 
basic distribution, value each having probability 1/n 



Yl'Y* 2 ,Y* 2 , 



n 



and calculate u = q n (y* ,y* , . . . ,y*) = ln - ^ ^ y 

y 

(3) Independently repeat step (2) a large number of 
times, B, obtaining "bootstrap replications" u^, i = 1,2,...,B, 



and calculate 



1 

u = 



B 

B U i 



i=l 



B 



2 



Su 2 = .1^ - u) 



(1.35) 



(1.36) 



(4) Approximate (l-a)*100% confidence limits for q. 
Here are four different approaches: 

(a) Non-Parametric Approach (BTl) : Take the 

order statistics of bootstrap sample 

. . < u 



U (l) < U (2 ) < 



(B) 



then let j = r and take as confidence limits for q: 

2 BT = U (J) " q ‘ U (B-J+l ) = q BT (1.37) 

(b) Normal Approximation Approach (BT2 ) : If we 

assume the bootstrap sample is approximately normally dis- 
tributed then approximate confidence limits for q can be set 
down : 

- e - ~ (1.38) 



-BT = u - 2 1 - a/2 ‘ S u - « * u + Z 1 - a/2 ‘ S u 5 ^BT 



a 
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(c) Bias-Adjusted Non-Parametric Approach (BT3) : 
The bootstrap estimate of bias is 

BIAS = u - q 

where q is the estimate of q from the original data. Con- 
fidence limits for this case: 

2bt = U (J) ~ BIAS - <3 - U (B-J+1) “ BIAS - ^bt (1.39) 

(d) Bias-Adjusted Normal Approximation Approach 

(BT4) : In this approach the confidence limits are: 

3bt ; IS-B^ls-) -z 1 . a/2 -S u iqi (q-BiM) + 2l . a/2 -S u =5 BT (1.40) 

Some simulation results for these four approaches 
will be presented in the analysis section. Identification 
for these cases are 3T1, BT2 , BT3, 3T4. 
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II. SIMULATION PROCEDURE 



A simulation procedure has been used to compare the three 
methods for obtaining confidence intervals for the probability 
a repair time exceeds h = 4 hours in the case in which the 
log-normal assumption is met and other cases in which it 
isn't (the exponential and long-tailed exponential). Speci- 
fically, simulation has been used to compute 

(a) The actual coverage of the true survival probability 
by the confidence intervals given by the procedures 
under study, when the nominal coverage is (1-a) *100%. 

(b) Measure of confidence interval size: the expected 

width and standard deviation of width. 

The simulation programs were written in FORTRAN IV, and 
the simulations have been carried out on the IBM 3033 at the 
Naval Postgraduate School. The Naval Postgraduate School 
LLRANDOM package was used, along with the International Mathe- 
matical and Statistical Library (IMSL) random generator; 1000 
replications were used to evaluate each procedure in each 
distributional situation. Also B = 200 bootstrap replications 
were taken for each trial, four sample sizes, n = 10, 20, 30, 
40, and h = 4 hours and three distributions: Log-normal, Ex- 

ponential, and a Long-tailed Exponential were investigated. 

An outline of the simulation procedure now follows: 

(A) Log-normal: In this case the basic variables, the 

down or repair times, are i.i.d. log-normal (see Appendix B) . 
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This case was simulated for the following "population" pa- 
rameter values: 



(1) 


U = 


l. 


a 2 


= i 


(2) 


V = 


l. 


2 

a 


= ln2 


(3) 


V = 


l. 


a 2 


= 31n2 


(4) 


y = 


l. 


2 

a 


= ln2/3 . 



The procedures of the previous section were used to ob- 
tain confidence intervals for the probability a repair time 
exceeds h = 4 hours. 

(B) Stretched long-tailed exponential: Down or repair 

times come independently from a stretched long-tailed expo- 
nential; see Appendix B. The simulated data was treated as 
if it was a sample from a log-normal distribution and proce- 
dures of the previous section were carried out. In this 
simulation we used the log transformation to tend to convert 
the long-tailed exponential observations towards normality 
(symmetrize them) . The stretched long-tailed exponential 
model is: 

X = AZ (1 + CZ) (2.1) 

X is stretched long-tailed exponential where Z has the Ex- 
pond) distribution. Simulation was carried out for A= 3.225, 
C =0.1948. These values were taken in order to compare the 
results with Exponential (X = 0.22) case. 

(C) Exponential down or repair time: In this situation 

simulations were carried out for two cases. First taking the 



20 



log of exponential down or repair times, and treating these 
as having the normal distribution; that is treating the data 
as log-normally distributed. Second, taking the p power of 
data values, and then treating the transformed values as nor- 
mally distributed. Also Appendix C gives an algorithm for 
estimating the p value from the data. Simulations are also 
carried out for p = 0.33 (the classical Wilson-Hilferty value), 
and for three X values . 
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III. ANALYSIS 



The methods for obtaining 95% confidence intervals for 
the probability that a repair time exceeds h = 4 hours de- 
scribed in chapter 1 were performed on simulated data having 
various distributions; these distributions were described in 
chapter 2. Simulation results for each method are shown in 
Tables 1 to 11. 

If we examine these tables case by case, we can find 
these results: 

(A) Log-normal data: All three methods work very well 

for this case except BT1 and BT3; they seem to consistently 
have less than nominal coverage. The simple delta method 
exhibits good coverage, and always has relatively small 
average width, and also low standard deviation. It seems to 
work as well as JK and BT4 for small sample sizes (n = 10, 

20) . In large sample sizes all methods agree in their cover- 
age except for BT1 . The method BT4 always appears to exhibit 
over-coverage . 

(B) Stretched long-tailed exponential data: Table 5 

shows that JK and BT4 exhibit over-coverage when the sample 
size n = 10. JK, DL, BT4 appear to exhibit correct coverage 
for n = 20, 30, 40, the others don't; especially at sample 
size n = 40 they are very poor. Also there is decreased 
coverage for all methods when sample size increases; the 
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JK and BT4 methods have lower average width than does DL, 
when sample size increases. This is results of the bias. 

(C) Exponential case: Tables 6, 7, 3 show that the log- 

transformation may give very poor results for the exponential 
case, especially at Table 7. If we examine Table 6 and Table 
8, these tables show DL, JK, BT4 work well for small sample 
sizes. Tables 9, 10 and 11 indicate that the power transfor- 
mation works better than the log transformation. The JK, 

BT2 , and BT4 methods always have better coverage than the DL 
method. Also all methods agree in their coverage when the 
sample size increases, as was true for the log-normal case. 
Generally JK and BT4 exhibit acceptable coverage. 
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Table 1: Simulation results for log- Normal ( y= 1 . , a = 1 • 1 case 



h =4. 0 



Samp le 






Average 


St d. Dev 


Siz s 


Method 


Coverage 


Width 


Width 


10 


DL 


0 .9420 


0.4353 


0.0409 




JK 


0 .9520 


0.4985 


0. 1202 




3T1 


0 .8940 


0.4683 


0.0979 




3T2 


0 .9480 


0.5105 


0.1131 




3T3 


0.9150 


0.4857 


0.0940 




BT4 


C .9720 


0.5380 


0.1205 


20 


DL 


0.9380 


0.3221 


0.0184 




JK 


0.9560 


0.3455 


0.0510 




3T1 


0.9060 


0.3339 


0. 0461 




3T2 


0 .9400 


0.3389 


0.0452 




BT3 


0 .9290 


0.3405 


0.0469 




3T4 


0.96 20 


0.3495 


0 .0491 


30 


DL 


0 .9450 


0 .2670 


0.0114 




JK 


0.9520 


0.2795 


0.0302 




3T1 


0.92U0 


0.2748 


0.0306 




3T2 


0 . 94 30 


0.2737 


0.9296 




3T3 


0.9330 


0.2772 


0.0308 




3T4 


C .9550 


0.2804 


0.0293 


40 


DL 


0.9500 


0.2329 


0. 0084 




JK 


0 . 95 60 


0.2408 


0.0215 




3T1 


0.9230 


0.2375 


0.02U6 




3T2 


0.9360 


0. 2369 


0.0227 




BT3 


0.9460 


0. 2385 


0.0234 




3T4 


0 .96 10 


0. 2406 


0.0226 
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Tab: 



a 2: S 


im illation 


results for 


log-Normal (u= 


1 . ,o=.S3) 


Samp Is 
Siz e 


Method 


h =4 . 0 
Coverage 


Average 

Width 


Std. Dev 
Width 


13 


DL 


0 .9423 


0.4278 


0.0468 




JK 


0 .9553 


0.4931 


0. 1284 




3T1 


0.8753 


0.446 '4 


0.1069 




3T2 


0.9393 


0.4952 


0.1151 




BT3 


0.9190 


0.4717 


0. 1002 




3T4 


0.9753 


0.5352 


0.1281 


23 


DL 


0 .9373 


0.3165 


0.0218 




JK 


0 .9520 


0.3407 


0.0567 




3T1 


0 .9133 


0.3259 


0.0506 




3T2 


0.9383 


0.3333 


0.0499 




BT3 


0.92 83 


0.3325 


0.0506 




3T4 


0.9600 


0.3452 


0.0538 


33 


DL 


0 .94 50 


0.2624 


0. 0137 




JK 


0 .9523 


0.2755 


0.0342 




3T 1 


0.9303 


0. 2681 


0.0332 




BT 2 


0 .9430 


0. 2704 


0.0316 




BT3 


C .93 90 


0. 2713 


0.0334 




3T4 


0.9540 


0.2764 


0.0330 


43 


DL 


0 .9520 


0.2283 


0.0101 




JK 


0.9533 


0.2369 


0.0246 




3T1 


0 .93 13 


0.2316 


0.0250 




3T2 


0.9443 


0.2332 


0.0240 




3T3 


0.9470 


0.2335 


0. 0253 




BT 4 


0.9560 


0.2363 


0.0249 



cas 
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Table 3: Simul: 



a -ion results for log-Normal ( y=1 . ,a=.48) case, 
h =4 . 0 



Samp le 






average 


St d. Dev 


Siz e 


Method 


Co veraga 


Width 


Width 


ID 


DL 


0.9433 


0.3823 


0.0700 




JK 


0.9583 


0.4621 


0.1640 




3T1 


0.35 33 


0. 3525 


0.1263 




BT2 


0 .9493 


0.4404 


0.1440 




BT3 


0.91 90 


0.3944 


0. 1218 




3T4 


0.9810 


0.5133 


0.1615 


23 


DL 


0.9463 


0.2813 


0.0366 




JK 


0.9523 


0. 3104 


0.0771 




3T 1 


0.9323 


0.2713 


0.0630 




3T2 


0.9383 


0.2395 


0.0646 




3T3 


0 .9290 


0.2845 


0.0633 




BT4 


G . 95 30 


0.3167 


0.0714 


33 


DL 


0.9460 


0.2325 


0.0241 




JK 


0 .9480 


0.2483 


0.0480 




3T 1 


G .9220 


0.2275 


0.0417 




3T2 


0 .9420 


0.2363 


0.0421 




3T 3 


0 .9353 


0.2345 


0.0421 




3T4 


0.9510 


0.2503 


0.0446 


43 


DL 


0.95 80 


0.2022 


0. 0181 




JK 


0.9560 


0.2113 


0.0349 




BT 1 


0 .9260 


0. 1979 


0.0317 




3T2 


0.9480 


0.2033 


0. 0315 




3T3 


0 .9450 


0.2024 


0.0322 




BT 4 


0.9550 


0.2121 


0. 0333 
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e 4: 


Simulation 


results for 


log-Normal (y 


It 

• 

Q 

II 

• 




case . 


h=4. 0 






Samp 1 


O 




Average 


St d. Dev 


Siz 6 


Method 


Coverage 


Width 


Width 


10 


DL 


0.9420 


0.4440 


0.0324 




JK 


0 .9520 


0.5045 


0.1093 




ST 1 


0.8880 


0.4351 


0.0886 




ST 2 


0.9430 


0.5169 


0.0988 




3T3 


0.9210 


0.5027 


0.0849 




ST 4 


0.9730 


0.5417 


0.1101 


20 


DL 


0.9410 


0.3285 


0.0135 




JK 


0.9550 


0.3503 


0.0423 




BT 1 


0.9180 


0.3464 


0.0410 




BT2 


0 .9400 


0.3483 


0.0394 




3T3 


0 .9290 


0.3499 


0.0413 




BT 4 


0.9620 


0.3544 


0.0421 


30 


DL 


0.9440 


0.2724 


0.0081 




JK 


0.9520 


0.2862 


0.0241 




BT 1 


0.9280 


0. 2825 


0.0271 




3T2 


0.9390 


0.2922 


0.0245 




BT 3 


0.9360 


0.2941 


0. 0273 




BT 4 


0.9560 


0.2849 


0.0254 


40 


DL 


0.9520 


0.2375 


0.0059 




JK 


0 .96 00 


0.2451 


0.0165 




BT 1 


0 .93 90 


0.2435 


0.0209 




3T2 


0.9480 


0.2434 


0.0186 




3T3 


0.94 90 


0.2444 


0.0210 




3T4 


0.9570 


0.2449 


0.0191 



4 ) 



27 



Table 5: Simulation results for Stretched Long-tailed 
Exponential . 







o 

• 

zt 

ii 


=3.225 C 


=0.1948 


Sam? le 






Average 


St d. Dev 


Siz e 


Method 


Co verage 


W idth 


Width 


13 


DL 


0.9530 


0.4341 


0.0350 




JK 


0 .9720 


0.4973 


0.1094 




3T1 


0 .85 53 


0.4404 


0.1052 




3T2 


C .92 30 


0.4632 


0. 1055 




BT3 


0.9393 


0.4563 


0.1001 




BT4 


C .9353 


0.4893 


0. 1058 


23 


DL 


0.9653 


0.3195 


0.0170 




JX 


0.95 90 


0.3173 


0.0499 




BT 1 


0 .89 13 


0.3013 


0.0485 




BT2 


0.9173 


0.3015 


0.0460 




3T3 


0 .9203 


0.3044 


0.0476 




BT 4 


0.95 90 


0. 3079 


0.0453 


33 


DL 


0.9573 


0.2642 


0.0110 




JK 


0 .9463 


0.2493 


0.0321 




5T1 


0.9323 


0.2424 


0.0333 




3T2 


0.9153 


0.2415 


0.0313 




BT3 


0.9173 


0 . 2443 


0.0329 




3T4 


0 .94 23 


0.2445 


0.0309 


43 


DL 


0.9433 


0.2301 


0.0087 




JK 


C .9163 


0.2115 


0.0245 




BT 1 


0 .8753 


0.2075 


3.0260 




BT2 


0.8940 


0.2063 


0.0244 




ST 3 


0.8953 


0.2086 


0.0258 




BT4 


0.9243 


0.2037 


0.0241 
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Table 6: Simulation results for EXP(X) rase using Log 

transformation. 



Samp le 


o 

i 

■r* 

II 


X=0 .22 


Siz £ 


Method 


Coverage 


10 


DL 


0.9480 




JK 


0.9 290 




BT 1 


0.8450 




3T2 


0.9090 




BT3 


0.9010 




BT4 


0.9740 


20 


DL 


0.9520 




JK 


0.9060 




BT 1 


0.9680 




BT2 


0. 8840 




BT3 


0.8930 




3T4 


0.9310 


30 


DL 


0.9380 




JK 


0.8790 




BT 1 


0.3530 




3T2 


0.3680 




BT3 


0.3810 




BT4 


0.8990 


40 


DL 


0.9010 




JK 


0.3280 




BT 1 


0.8 110 




3T2 


0.8 160 




3T3 


0.8 250 




3T4 


0.8440 



P (X>4 . 0) =0.4 148 



Average 


5 td . Dev 


P o i r. i 


Width 


Width 


Estimate 


0.4432 


0 . 0265 


0. 3731 


0.4476 


0. 1000 


0. 3639 


0.4650 


0.0989 


0. 3707 


0.4308 


0. 1043 


0. 3707 


0.4772 


0. 0974 


0. 3707 


0.4975 


0. 1036 


0. 3763 


0.3261 


0. 0127 


0. 3685 


0.3025 


0.0473 


0. 3665 


0.3133 


0.0489 


0.3670 


0.3105 


0.0457 


0.3670 


0.3151 


0.0483 


0. 3670 


0.31 36 


0. 0445 


0. 3702 


0.2697 


0.0031 


0. 3661 


0.2416 


0. 0310 


0. 3651 


0.2497 


0.0338 


0. 3649 


0.2467 


0.0311 


0. 3649 


0.2504 


0.0335 


0. 3649 


0.2431 


0.0306 


0. 3674 


0.2349 


0. 0065 


0. 3650 


0.2068 


0.0237 


0. 3644 


0.2124 


0. 0265 


0. 3641 


0.2105 


0. 0245 


0. 3641 


0.21 28 


0. 0263 


0. 3641 


0.2112 


0.0241 


0.3660 
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ble 7: 


Simulation 


resul-s for 


EXP ( A) :a 


5 s using 


Log 




t ra reformat 


ion. 










h=4 . 0 


A=0 .13 


P (X>4 . 0) 


=0. 5945 




Samp 1 


0 




Average 


5 td . Dev 


Point 


Siz e 


Method 


Coverage 


Width 


Width 


Estimate 


10 


DL 


0.9 100 


0.4475 


0.0298 


0. 5492 




JK 


0.9010 


0.5282 


0.1199 


0.5218 




3T1 


0.8690 


0.5064 


0.0922 


0. 5718 




ST2 


0.9110 


0.5490 


0.1102 


0. 5713 




3T3 


0.9020 


0.5280 


0.0932 


0. 5713 




3T4 


0.9410 


0.5728 


0.1213 


0.5245 


20 


DL 


0.9810 


0.3329 


0. 0103 


0. 5385 




JK 


0.3660 


0.3774 


0.0665 


0. 5250 




BT 1 


0.8970 


0.3705 


0.0462 


0. 5500 




3T2 


0.8820 


0.3740 


0. 0521 


0. 5500 




BT3 


0.9010 


0.3750 


0. 0502 


0. 5500 




ET4 


0. 8840 


0.3796 


0.0585 


0. 5267 


30 


DL 


0.8470 


0.2763 


0. 0057 


0. 5343 




JK 


0.8 390 


0.3053 


0.0409 


3. 5 255 




3T1 


0.8840 


0.3031 


0. 0319 


0. 5421 




3T2 


0.8600 


0.3018 


0.0322 


0. 5421 




3T3 


0.3890 


0.3048 


0.0336 


0. 5421 




BT4 


0.8460 


0.3043 


0 . 0354 


0. 5264 


40 


DL 


0.7900 


0.2411 


0.0035 


0. 5315 




JK 


0.7810 


0.2615 


0. 0293 


0.5251 




3T1 


0. 8330 


0.2594 


0.0266 


0. 5375 




3?2 


0.8 120 


0.2594 


0.0256 


0.5375 




BT3 


0.8370 


0.2604 


0.0276 


0. 5375 




3T4 


0.7860 


0.2607 


0. 0273 


0. 5255 
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Table 8: Simulation results for 2X?(x) rase using Log 
transformation. 





h = 4 . 0 


A=0 .26 


P (X>4 . 0) 


=0.3535 




Samp le 






Average 


S td . Dev 


Paint 


Size 


Method 


Coverage 


Width 


Width 


Estimat e 


13 


DL 


0.9630 


0.4322 


0. 0343 


0. 3197 




JK 


0.9530 


0.4188 


3. 0994 


0. 3225 




5T1 


0.8460 


0.4 276 


0.1072 


0. 3097 




BT2 


0.9260 


0.4478 


0. 1062 


0.3097 




3T3 


0. 9060 


0.4427 


0.1021 


0. 3097 




3T4 


0.9880 


0.4726 


0. 1052 


0. 331 1 


23 


DL 


0.9660 


0.3173 


0.0171 


0. 3178 




JK 


0.9380 


3.2789 


0 . 0458 


0. 3192 




BT1 


0.8790 


0.2859 


0. 0489 


0. 3124 




ET2 


0.8980 


0.2857 


0.0459 


0. 3124 




BT3 


0.9070 


0.2341 


0. 0473 


0. 3124 




3T4 


0.9500 


0.2917 


0. 0447 


0. 3232 


33 


DL 


0.9550 


0.2621 


0.0111 


0.3161 




JK 


0.9 160 


0.2228 


0. 0298 


0. 3 174 




3T1 


0. 8700 


0. 2282 


9. 0332 


0. 3122 




BT2 


0.8890 


0.2271 


3. 0309 


0. 3122 




BT3 


0.8960 


0.2298 


0. 3327 


0. 3122 




3T4 


0.9 350 


0.2301 


0. 0302 


0. 3199 


40 


DL 


0.9370 


0.2282 


0.0088 


0. 3155 




JK 


0.8840 


0.1910 


0.0229 


0. 3167 




3T1 


0.8390 


0. 1 947 


0. 0258 


0. 3127 




372 


0.8 510 


0.1938 


0. 024 1 


0. 3127 




BT3 


0.8650 


0. 1956 


0.0255 


0. 3127 




3T4 


0.8 943 


0.1956 


0.0236 


0. 3135 
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ble 9: S 


imula tion 


results for 


EXP(X) :ass 


using X 1 




ransf ormat 


ion. 










h = 4. 0 X =3. 


22 p=3 .33 




Samp le 






Average 


Sid. Dev 


Siz e 


Method 


Co ver age 


Width 


Width 


13 


DL 


0.9303 


0.4463 


0.0302 




JK 


0 .94 13 


0.5039 


0.0944 




BT 1 


0 .86 90 


0.4941 


0.0857 




3T2 


0.9333 


0.5234 


0.0929 




BT3 


0.9353 


0.5101 


0.0829 




3T4 


0.9763 


0.5459 


0.0993 


23 


DL 


0 .9393 


0.3307 


0.0118 




JK 


0 .96 10 


0.3555 


0.0363 




5T1 


0 .9153 


0.3515 


0.0356 




5T2 


0.9453 


0.3535 


0.0335 




3T3 


0 .9353 


0.3545 


0.0359 




BT4 


0.96 33 


0.3533 


0.0360 


33 


DL 


0 .94 10 


0.2742 


0. 0069 




JK 


0 .9573 


0.2888 


0.0200 




BT 1 


0.9213 


0.2875 


0.0250 




BT 2 


0 .9423 


0.2374 


0.0220 




3T3 


0.9313 


0.2887 


0.0254 




3T4 


0.95 73 


0.2897 


0.0232 


43 


DL 


0.9273 


0.2393 


0.0055 




JK 


0 .9443 


3.2491 


0.0133 




3T1 


0.9203 


0.2430 


0.0193 




3T2 


0.9373 


0. 2479 


0.0163 




BT3 


0.9273 


0.2437 


0.0194 




3T4 


0.9440 


0.2492 


0.0167 
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Table 13: Simulation results for EXP (x ) case using X p 
transformation. 



h=4.0 x=3. 13 p=0 .33 



Samp le 






Average 


St d. Dev 


Siz <5 


Method 


Coverage 


Width 


Width 


10 


DL 


0 .9350 


0.4440 


0.0291 




JK 


0.9530 


0.4911 


0.0912 




3T 1 


0.8670 


0.4804 


0.0896 




3T 2 


0 .94 90 


0.5133 


0. 0933 




BT3 


0 .9070 


0.4963 


0.0343 




3T4 


0 .9760 


0.5377 


0. 1005 


20 


DL 


0 .94 80 


0.3282 


0.0127 




JK 


0 .9670 


0.3411 


0.0351 




BT 1 


0 .9150 


0.3409 


0.0403 




3T2 


0.94 30 


0.3427 


0.0372 




BT 3 


0.9350 


0.3439 


0.0403 




5T4 


0.9700 


0.3479 


0.0385 


30 


DL 


0.95 10 


0.2720 


0.0084 




JK 


0.96 10 


0. 2752 


0 .0195 




3T1 


0 .91 80 


0.2775 


0.0274 




3T2 


0 .9450 


0.2770 


0.0235 




3T3 


0.9360 


0.2783 


0.0273 




BT 4 


0 .9710 


0.2792 


0.0237 


40 


DL 


C .9420 


0.2372 


0.0060 




JK 


0 .9520 


0.2380 


0.01^0 




BT 1 


0 .9130 


0.2385 


0. 0213 




3T2 


0.9370 


0.2386 


0.0178 




3T3 


0 .9220 


0.2392 


0.0212 




3T4 


0 .9530 


0.2399 


0.0179 
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able 11 : Si 


mulation 


results for 


EXP (A) case 


using 




ans forma 


tion. 










• 

O 

It 

r< 

O 

• 

It 


26 p=0 . 3 3 




Samp le 






Average 


St d. Dev 


Siz e 


Method 


Cc ver age 


Width 


Width 


10 


DL 


0 .9270 


0.4346 


0.0420 




JK 


0 .9390 


0.4970 


0.1106 




ST 1 


0.8660 


0.4677 


0.1027 




BT2 


0.9310 


0.5032 


0.1072 




BT3 


0.9100 


0.4892 


0.0974 




BT4 


0.9760 


0.5415 


0.1127 


20 


DL 


0.9380 


0.3224 


0.0136 




JK 


0.9570 


0.3505 


0.0482 




3T1 


0.9060 


0.3381 


0.0436 




BT2 


0 .9420 


0.3439 


0.0423 




BT 3 


0 .9290 


0.3434 


0.0435 




BT4 


0 .95 10 


0.3534 


0.0453 


30 


DL 


0.9380 


0.2572 


0.0117 




JK 


0.9500 


0.2345 


0.0235 




BT 1 


0 .9220 


0.2731 


0.0294 




BT 2 


0.9430 


0.2905 


0.0277 




BT3 


0 .9290 


0. 2805 


0.0297 




ST 4 


0 .9530 


0.2851 


0.0291 


40 


DL 


0.9240 


0.2323 


0. 0094 




JK 


0.9390 


0.2454 


0.0198 




BT 1 


0.9150 


0.2412 


0.0225 




BT 2 


0 .9290 


0.2422 


0.0207 




BT 3 


0.9210 


0.2427 


0. 0226 




BT 4 


C .94 10 


0.2451 


0.0212 
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IV. EXAMPLE: APPLICATION TO OPERATIONAL DATA 



In this chapter four methods were applied to a real data 
set. The methods are Binomial (BN) , Delta (DL) , Jackknife 
(JK) , Bootstrap (see section I-b) ; specifically, the data 
refer to recovery times from loss of offsite power at nuclear 
plants. The problem was to estimate survival probabilities 
for h = 1.5, 2.0, 2.5, 3.0, 3.5, 4.0 (hours). Data points 
(n = 42) are shown in appendix D. We initially applied 
several statistical goodness-of-f it tests to inquire into 
the evidence for the adequacy of the Log-Normal distribution 
as a model for these data. The results of these goodness of 
fit tests are as follows: 

(1) Chi-square test: See Arnold, D. [Ref. 5]; this ac- 

cepts the log-normal model at the significance level 
a = 0.05. 

(2) Kolmogorov-Smirnov test: See Arnold, D. [Ref. 5]; 

this test rejects the log-normal model with the tabu- 
lated value C = 0.1367 and test statistic D = 0.21 for 
a = 0.05. 

(3) Wilk-Shapiro test: See Hahn, G. J. [Ref. 6]; this 
test accepts log-normal model for a = 0.05. 

We applied four estimation methods to these data, utiliz- 
ing the log-normal assumption. The results are shown in 
Table 12. 
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Table 12: Recovery Time Example Results (h in hours) 



H 



1 . 5 



2.0 



2.5 



3.0 





Op per Cor.f . 


Pt int 


Lower Cor.f. 




Method 


Limits 


Esti mation 


Limits 


Width 


SN 


0.476 


0. 333 


0.191 


0. 285 


DL 


0.445 


9. 324 


0.219 


0. 226 


JK 


0.432 


0. 324 


0.230 


0.202 


BT 1 


0.435 


0. 318 


0. 215 


0. 220 


3T2 


0.438 


0. 318 


0.214 


0.224 


BT3 


0.441 


3. 313 


0.215 


0. 226 


3T4 


0.451 


0. 330 


0. 224 


0. 227 


BN 


0.4 22 


0. 286 


0. 149 


0. 273 


DL 


0.400 


0. 230 


0. 182 


0.218 


JK 


0.334 


0. 282 


0. 195 


0. 189 


BT 1 


0.379 


0. 279 


0. 193 


0. 130 


3T2 


0.374 


0. 278 


0.197 


0. 177 


BT 3 


0.381 


0. 278 


0.198 


0. 1 82 


3T4 


0.378 


0. 282 


0. 200 


0. 178 


3N 


0.422 


0. 236 


0. 149 


0.273 


DL 


0.366 


0. 249 


0. 155 


0.211 


JK 


0.348 


0. 250 


0.169 


0. 179 


3T1 


0.330 


0. 250 


0. 173 


0. 155 


BT 2 


0.345 


0.250 


0.171 


0.174 


BT3 


0.329 


0. 250 


0. 175 


0. 153 


3T4 


0.342 


0. 247 


0. 169 


0. 173 


BN 


0.395 


0. 26 2 


0.129 


0. 266 


DL 


0.339 


0. 224 


0. 135 


0.204 


JK 


0.320 


0. 226 


0. 150 


0. 170 


BT 1 


0.309 


0. 223 


0. 135 


0. 174 


BT2 


0.324 


0. 223 


0. 143 


0. 182 


BT3 


0.311 


0. 223 


0.135 


0. 176 


3T4 


0.327 


0. 225 


0.144 


0. 183 
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3.5 



BN 


0.395 


0. 262 


0. 129 


0. 266 


DL 


0.318 


0. 204 


0. 120 


0. 198 


JK 


0.298 


0. 207 


0. 135 


0. 163 


BT 1 


0.287 


0. 204 


0. 140 


0. 147 


BT2 


0.289 


0. 204 


0.136 


0. 152 


BT3 


0.288 


0. 204 


0. 140 


0. 147 


BT4 


0.290 


0. 205 


0. 137 


0. 153 


3N 


0.367 


0. 238 


0. 109 


0. 258 


DL 


0.299 


0. 188 


0.107 


0. 1 92 


JK 


0.279 


0. 191 


0. 122 


0. 157 


3T1 


0.263 


0. 195 


0. 1 13 


0. 150 


3T2 


0.273 


0. 185 


0. 1 17 


0. 156 


3T3 


0.267 


0. 185 


0. 113 


0. 154 


3T4 


0.232 


0. 192 


0. 122 


0. 160 
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V. CONCLUSIONS 



The Delta, Jackknife and Bootstrap methods applied to the 
log-normal model work well when down or repair times are 
truly log-normal. Especially notice that DL, JK, and BT4 
seem to work much better than BTl , BT2 , BT3 . Recall that 
these procedures do not appear sensitive to the population 
variance; see section III. It is comparatively easy to use 
JK and BT4 when sample size is small (n = 10, 20) . The delta 
method is always convenient, but especially when the sample 
is large (n = 40 or more) because it is a very simple pro- 
cedure to apply, requiring much less computation than the 
others. As Table 12 shows, the Binomial method gives some 
idea of the survival probability for practical purposes. 

Note that Binomial confidence limits are much wider than 
those that assume the log-normal model. 

Use of the log transform on exponential data produces 
biased estimates of survival. Use of the power transforma- 
tion with (p = 1/3) always gives a better coverage of the 
survival probability when data are exponential. One proce- 
dure was described in Appendix C for estimating the p value 
from data. Table 13 gives simulation results for the expo- 
nential case. As our results show, this procedure is not 
estimating p value correctly. If this procedure were to work 
correctly (if it could be calibrated) then we could use the 



38 



p 

x transformation (for converting data towards the normal 
form) without making any assumption (e.g. this data coming 
from exponential or gamma or log-normal, etc.). Then after 
this is done, methods DL, JK, BT4 might produce considerably 
better confidence limits for the actual survival probability. 
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APPENDIX A 



COMPUTER PROGRAMS 



Simulation programs consist of two main programs for 
three methods (DL, JK, 3T) . These main programs compute 
survival probability confidence limits, and scores the cover- 
age for each replication. Then, after 1000 replications the 
program computes the statistics of these parameters and 
prints out the results. 

There is another program, called SURVP. This program 
computes point estimates, confidence limits, and widths of 
confidence limits on survival probability, using the BN, DL, 
JK, BT procedures on a given data set, under the log-normal 
model assumption. 

Variables List : 

R = Down or repair times. 

R1 = Log of down or repair times. 

RBAR = Mean of down or repair times. 

RSD = Standard deviation of down or repair times. 

GHAT = Point estimation of q parameter (see 1.17) for delta 
method 

GJK = Point estimation of q parameter for jackknife method. 

G300T = Point estimation of q parameter for bootstrap 
method . 

VARG = Variance of point estimation for delta method. 
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SE = Standard error of point estimation for jackknife 
method . 

PHAT = Point estimation of survival probability. 

BUP = Upper confidence limit estimation of q parameter. 

BLOW = Lower confidence limit estimation of q parameter. 

CUP = Upper confidence limit estimation of survival 
probability . 

CLOW = Lower confidence limit estimation of survival 
probability . 

AINT = Width of estimated confidence limits of survival 
probability. 

G = Pseudovalues. 

F = Bootstrap replications. 

N = Number of data points. 

N1 = Number of replications. 

N2 = Number of bootstrap replications. 
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THIS PROGRAM SIMULATES LOG-NORMAL CASE 
USING BT4 PROCEDURES. 
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APPENDIX B 



LOG-NORMAL AND STRETCHED LONG-TAILED 
EXPONENTIAL DISTRIBUTION 



(1) Let x be log-normal random variable which ln(x) 
2 

N(y,a ) k moment of x as follows: 



see [Ref. 7] so 



E[x k ] = exp (ky + l/2*k 2 a 2 ) 



E[x] = exp(y + l/2*a ) 



E[x 2 ] 



= exp (2 y + 2 *o ) 



Var[x] = 



(E [x] ) 2 (e° 2 - 1) 



so "coefficient of variation" as follows 



VarUL = e ° 2 - 1 
(E[x] ) Z 



(2) Let W be stretched long-tailed exponential variable 
which W = A*z(l + C*z) where z is unit exponential and A and 
C are constant. If we write CDF for this distribution as: 

P (W-w) = P[A*z(l + C • z ) - w] = P [ Z-z (w) ] 

W = A*z(l -i- C*z) . If we solve this equation for z we can get 
z (w) as 



z (w) = 



-A ± v A 2 + 4ACw 
2 AC 
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E [w] = A ( 1 + 2C) 



E [w 2 ] = A 2 [2 + 12C + 24C] 

Var [w] = A 2 [l + 8C + 20C] 

( CV ) 2 = Var [w] = 1 + 8C + 20C 2 

(E [w] ) 2 1 + 4C + 4C 2 

2 

If we look at w as a log-normal (y,a ) variable then 

1 + 8C + 20C 2 a 2 

2 e - i 

1 + 4C + 4C 

We can get C value from this equation, then 

E [w] = A ( 1 + 2C) = e y + 

gives A value. 
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APPENDIX C 



POWER TRANSFORMATION 



The problem is in the x 1 - transformation (toward the nor- 
mal form) finding the p value for given data. One method 
for finding the p value is as follows [Ref. 8]: 

X l' X 2'**./ X n ^ ata P 0 i nts an< ^ M = Median of this data. 

(1) Take order statistic of given data 

x (l) < x (2 ) < < x (n) 



(2) Then compute values as 



q j 



= i- Pj = 



( x (n-i+1) 



( x (n-j+1) ' M ) 2+ ( M - X (j)) 2 



2 -M j = 1 ,2 , . . . ,n/2 



where M is the median of the data. 

(3) Take the median or mean of 

q = median (q^) or q = 



1 

[n/2] 



n/2 

.1 qj 
j=i D 



(4) Then we can get p value as p = 1 - q . 

Table 13 gives simulation results for this algorithm for 
the exponential case. For the exponential case the best p 
value is p = 1/3. Simulation results do not give this value 
so this algorithm is not working correctly. 
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Table 13: Simulation results 



? algorithm using 



Exponential data. 



X=0.22 



Samp le 
Sit e 

10 
20 
30 
40 
50 
60 
70 
80 
90 
100 
1 10 
120 
130 
140 
150 



Average 
of P 

0.74 34 
0.6039 
0.5336 
0 . 51 21 
0.4903 
0. 4859 
0.4658 
0.4571 
0. 44 98 
0.4470 
0.4403 
0.4323 
0.4298 
0 . 4263 
0.4 196 



Variance 
of P 

0 .9174 
0.3507 
0 .24 55 
0.1872 
0 .1397 
0.1122 
0.0912 

0 .08 14 

0.0713 
0.0669 
0 .0580 
0 .0515 
0 .04 34 
0.0452 
0 .03 94 



S kewness 
of ? 

0.7535 
1 .3270 
1.3619 
1 .8234 
1.4981 
1.4225 
1.5475 
1.6633 
1.5987 
1 .8234 
1.9513 
1.4446 
1.3154 
1 .4466 
1 .5704 
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APPENDIX D 



MEAN SQUARE ERROR OF SURVIVAL PROBABILITY 

Mean-square errors were calculated for exponential case 
using the log transformation for three values and h = 4.0. 
The procedure is as follows: 

(1) Generate exponential sample x^,X 2 ,...,x n , 
n = 10,20, ...,250. 

(2) Find actual survival probability as: 

P (x>h) = e 

(3) Estimate survival probability (incorrectly) as: 

P(x>h) = 

lnh- 

d 

(4) Calculate (P (h) - P (h) ) ^ 

(5) Repeat this procedure 1000 times. 

(6) Calculate MSE as: 

/V 1 1000 o 

MSE = ToOT .1 [p<h) ‘ p i (h)1 

1=1 

Simulation results for mean square error of survival 
probability were shown in Tables 14, 15 and 16. 
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Table 14: Hear. Square Error of Survival probability 
for EXP (X) case using Log transformation. 





li 

-P 

• 

o 

li 

o 


.22 


Sample 


Mean Square 


Square Root 


Siz e 


Error (MSE) 


of MSE 


10 


0.0166 


0. 1288 


20 


0. 0C85 


0.0920 


30 


0. 0063 


0.0795 


40 


0. 0056 


0.0746 


50 


0. 0049 


0.0699 


60 


0. 0045 


0. 0673 


70 


0. 0041 


0.0642 


80 


0. 0040 


0.0635 


90 


0. 0033 


0.0614 


100 


0.0038 


0.0612 


110 


0. 0037 


0. 0604 


120 


0. 0035 


0.0594 


130 


0.0034 


0. 0585 


140 


0.0033 


0. 0578 


150 


0. 0033 


0. 0575 


160 


0.0033 


0.0572 


170 


0. 0032 


0.0564 


180 


0. 0031 


0.0560 


190 


0.0032 


0.0562 


200 


0. 0032 


0.0563 


210 


0. 0031 


0.0561 


220 


0. 0031 


0.0557 


230 


0.0031 


0.0553 


240 


0. 003 1 


0.0557 


250 


0. 0031 


0.0553 



58 



e 15 : 


Kea 


n Square Errc 


r of Survival j 




for 


EX?(A) case 


using Log trans 






h=4 . 0 


A=0.13 


Sanp le 




Mean Square 


Square R: 


Size 




Error (MSE) 


Of MSE 


10 




0. 0234 


0. 1531 


20 




0.0128 


0. 1 132 


30 




0. 0098 


0. 0991 


40 




0. 0089 


0.0942 


50 




0.0081 


0.0897 


60 




0. 0076 


0. 0 869 


70 




0. 0070 


0.0834 


80 




0. 0068 


0. 0824 


90 




0.0065 


0.0803 


100 




0. 0064 


0. 0802 


1 10 




0. 0063 


0.0792 


120 




0. 0061 


0.0781 


130 




0. 0059 


0. 0770 


140 




0. 0059 


0. 0765 


150 




0. 0058 


0. 0762 


160 




0. 0058 


0. 0758 


170 




0. 0057 


0. 0753 


180 




0. 0055 


0.0745 


190 




0. 0056 


0.0746 


200 




0. 0056 


0. 0747 


210 




0. 0056 


0. 0747 


220 




0. 0055 


0.0744 


230 




0. 0056 


0.0745 


2U0 




0. 0055 


0. 0743 


250 




0. 0054 


0. 0737 
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Table 15: Mean Square Error of Survival probab 
for EXP (A) case using Log transforma 



h=4 . 0 X=0.25 



Samp le 


Mean Square 


Square Roc 


Siz e 


Error (MSE) 


Of MSE 


10 


0.0135 


0. 1 160 


20 


0.0066 


0.0812 


30 


0.0047 


0.0633 


40 


0. 0041 


0.0637 


50 


0. 0034 


0.0585 


60 


0. 0031 


0.0559 


70 


0. 0028 


0. 0525 


80 


0.0027 


0. 0519 


90 


0. 0025 


0.0495 


100 


0.0024 


0. 0492 


110 


0.0023 


0. 0483 


120 


0. 0022 


0.0472 


130 


0. 0022 


0.0465 


1U0 


0. 0021 


0.0456 


150 


0. 0021 


0. 0453 


160 


0. 0020 


0. 0443 


170 


0.0019 


0. 0439 


130 


0.0019 


0. 0435 


190 


0. 0019 


0. 0437 


200 


0.0019 


0.0438 


210 


0. 0019 


0. 0435 


220 


0. 0019 


0.0431 


230 


0. 0019 


0.0431 


240 


0.0013 


0. 0 429 


250 


0.0018 


0.0425 
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APPENDIX E 



These 
plants : 
24.6160 
0.6166 
0.2666 
6.4660 
0.6666 
0.4833 
0 . 1500 
0.0666 
0.3333 



DATA POINTS FOR EXAMPLE 



are recovery times (hours) from LOSP at nuclear 



25.6660 


11.0830 


1.5000 


1.1833 


5.8000 


4.9830 


0.2833 


1.0000 


0.4333 


0.1333 


0.0028 


0.9333 


2.6660 


4.7500 


0.0166 


8.9000 


0.0041 





0.0038 


0.3333 


0.0333 


0.0500 


1.8330 


0 . 5000 


0.9000 


0.1666 


0.0166 


5.5833 


0.2333 


4.9833 


0 . 1333 


1.0166 


3.5000 


0.9166 
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